Typical Sequences for Polish Alphabets

نویسنده

Patrick Mitran

چکیده

The notion of typical sequences plays a key role in the theory of information. Central to the idea of typicality is that a sequence x1, x2, . . . , xn that is PX -typical should, loosely speaking, have an empirical distribution that is in some sense close to the distribution PX . The two most common notions of typicality are that of strong (letter) typicality and weak (entropy) typicality. While weak typicality allows one to apply many arguments that can be made with strongly typical arguments, some arguments for strong typicality cannot be generalized to weak typicality. In this paper, we consider an alternate definition of typicality, namely one based on the weak* topology and that is applicable to Polish alphabets (which includes R). This notion is a generalization of strong typicality in the sense that it degenerates to strong typicality in the finite alphabet case, and can also be applied to mixed and continuous distributions. Furthermore, it is strong enough to prove a Markov lemma, and thus can be used to directly prove a more general class of results than weak typicality. As an example of this technique, we directly prove achievability for Gel’fand-Pinsker channels with input constraints for a large class of alphabets and channels without first proving a finite alphabet result and then resorting to delicate quantization arguments. While this large class does not include Gaussian distributions with power constraints, it is shown to be straightforward to recover this case by considering a sequence of truncated Gaussian distributions. Index Terms Typical sequences, weak* topology, capacity, Gel’fand-Pinsker.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Practical Rank/Select Queries over Arbitrary Sequences

We present a practical study on the compact representation of sequences supporting rank, select, and access queries. While there are several theoretical solutions to the problem, only a few have been tried out, and there is little idea on how the others would perform, especially in the case of sequences with very large alphabets. We first present a new practical implementation of the compressed...

متن کامل

Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment

MOTIVATION Many proteins with vastly dissimilar sequences are found to share a common fold, as evidenced in the wealth of structures now available in the Protein Data Bank. One idea that has found success in various applications is the concept of a reduced amino acid alphabet, wherein similar amino acids are clustered together. Given the structural similarity exhibited by many apparently dissim...

متن کامل

Typicality Graphs:Large Deviation Analysis

Let X and Y be finite alphabets and PXY a joint distribution over them, with PX and PY representing the marginals. For any ε > 0, the set of n-length sequences x and y that are jointly typical [1] according to PXY can be represented on a bipartite graph. We present a formal definition of such a graph, known as a typicality graph, and study some of its properties.

متن کامل

Statistics of RNA secondary structures.

A statistical reference for RNA secondary structures with minimum free energies is computed by folding large ensembles of random RNA sequences. Four nucleotide alphabets are used: two binary alphabets, AU and GC, the biophysical AUGC and the synthetic GCXK alphabet. RNA secondary structures are made of structural elements, such as stacks, loops, joints, and free ends. Statistical properties of ...

متن کامل

A formalism for studying long - range correlations in many - alphabets sequences

S. L. Narasimhan, Joseph A. Nathan, P. S. R. Krishna and K. P. N. Murthy Solid State Physics Division, Reactor Physics Design Division Bhabha Atomic Research Centre, Mumbai-400085, India. Materials Science Division, Indira Gandhi Centre for Atomic Research, Kalpakkam 603102, Tamilnadu, India. Abstract We formulate a mean-field-like theory of long-range correlated L-alphabets sequences, which ar...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

CoRR

دوره abs/1005.2321 شماره

صفحات -

تاریخ انتشار 2010

Typical Sequences for Polish Alphabets

نویسنده

چکیده

منابع مشابه

Practical Rank/Select Queries over Arbitrary Sequences

Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment

Typicality Graphs:Large Deviation Analysis

Statistics of RNA secondary structures.

A formalism for studying long - range correlations in many - alphabets sequences

عنوان ژورنال:

اشتراک گذاری